Estimating the age of renal tumors 
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Abstract 



We present a Bayesian method for estimating the age 
of a renal tumor given its size. We use a model of 
tumor growth based on published data from observa- 
tions of untreated tumors. We find, for example, that 
the median age of a 5 cm tumor is 20 years, with in- 
terquartile range 16-23 and 90% confidence interval 
11-30 years. 



1 Introduction 

For some cancer patients it is important to estimate 
a tumor's date of formation; for example, benefits 
provided by the U.S. Department of Veterans Affairs 
depend on whether it is likely that a tumor formed 
while the patient was in military service, among other 
considerations. Doctors are currently unable to pro- 
vide a statistical estimate of when a tumor formed. 

For renal cancers, we have reliable measurements 
for the rate of growth during a period of observation; 
this paper presents a method to use this data to esti- 
mate the distribution of ages for a renal tumor based 
on size at diagnosis. 

1.1 Prior work 

Several studies report growth rates for patients with 
untreated renal tumors [1-11]. We use this data to 
estimate the distribution of growth rates that is the 
basis of our model. 
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Figure 1: Distribution of RDT from Zhang et al. 



Some studies report that different subtypes grow at 
different rates, but Zhang et al [11] conclude "Growth 
rates in renal tumors of different sizes, subtypes and 
grades represent a wide range and overlap substan- 
tially." Other studies are consistent with this conclu- 
sion, so our growth model does not take into account 
tumor subtype or grade. 



2 Methodology 

We use data from Zhang et al [11] to estimate the 
distribution of growth rates during a period of obser- 
vation. We use this distribution to generate tumor 
growth histories. These simulations yield P(d\t), the 
distribution of diameter, d, as a function of the age 
of the tumor, t. Then we compute P(t\d), the distri- 
bution of age as a function of size. 
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Confidence interval for age vs diameter 
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Figure 2: 100 simulations of tumor growth. 



Figure 3: Percentiles of time since to- 



2.1 Distribution of growth rates 

Zhang et al report reciprocal doubling times (RDT), 
in units of doublings per year, for 53 "patients who 
underwent nephrectomy from 1989 to 2006 who did 
not receive preoperative chemotherapy or radiation 
therapy and underwent at least two preoperative con- 
trast material-enhanced CT examinations (at least 3 
months apart)" . 

We use data from their Figure 3 to construct 
the cumulative distribution function (CDF) of RDT, 
shown in Figure 1. The line shows a model of the data 
as a mixture of exponential distributions: with prob- 
ability p = 0.35 we generate a negative RDT with 
A2 = 5.0; otherwise we generate a positive RDT with 
Ai = 0.79. The model fits the data well, allowing us 
to interpolate between data points and characterize 
tail behavior. 

2.2 Simulated growth histories 

To simulate tumor growth, we start at a hypothet- 
ical time, t , when tumor volume is Vq = 0.01 mL 
(diameter 0.27 cm), and repeat these steps: 

1. Choose a random value from the modeled distri- 
bution of RDT. 

2. Compute the volume at the end of the interval, 
Vj+i = 2 hRDT Vi, where h is the duration of the 



interval in years. In Zhang et al, the median 
time between initial and final scans is 245 days, 
so we used this value for the interval, h. 



3. Repeat until volume exceeds V„, 
(diameter 20 cm). 
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Figure 2 shows the result of 100 simulations. The 
vertical scale is d, diameter in cm. Using these sim- 
ulations, we compute P(d\t) the distribution of size 
given t, time since to- 

2.3 Serial correlation 

In our model of tumor growth, the growth rate during 
each interval is independent of previous growth rates. 
It is plausible that, in reality, tumors that have grown 
quickly in the past are more likely to grow quickly. 

If this correlation exists, it affects the location and 
spread of our results. For example, running simula- 
tions with p = 0.4 increases the estimated median 
age by about a year, and the interquartile range by 
about 3 years. 

However, if there were a strong serial correlation 
in growth rate, there would also be a correlation be- 
tween tumor volume and growth rate, and prior work 
has shown no such relationship [7] [11]. 

There could still be a weak serial correlation, but 
since there is currently no evidence for it, we report 
results based on simulations with p = 0. 
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Table 1 : Percentiles of time since to • 



2.4 Inverse probabilities 

In Figure 2 the dashed line at 10 cm illustrates our 
method for inverting P(d\t). When a simulated his- 
tory crosses this line, that represents a point in time 
when a tumor could be observed at this size. Since we 
know the duration of each history, we can construct 
the distribution of ages for a tumor observed at this 
size. Repeating this computation for each size, we 
construct P(t\d). 



3 Results 

Figure 3 shows the confidence interval for t as a func- 
tion of size. The points are data from simulation, 
which produces some variability due to discrete ap- 
proximation. The lines arc fitted to the data. For 
each size, we compute the median of t, interquartile 
range, and 90% confidence interval. Table 1 shows 
these results numerically, for selected sizes. 



4 Conclusion 



lower bound on the tumor's age. This information is 
potentially useful to patients, doctors and insurers. 
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Using Table 1 we can look up the size of a tumor 
and find the distribution of time since to, which is a 



